DISSIMILARITY MAPS ON TREES AND THE REPRESENTATION THEORY OF 

Gi„(C) 

CHRISTOPHER MANON 

Abstract. We revisit the representation theory in type Aused previously to establish that the dis- 
similarity vectors of phylogenetic trees are points on the tropical Grassmannian variety. We use a 
^^ ' different version of this construction to show that the space of phylogenetic trees Kn maps to the 

fvj ' tropical varieties of every flag variety of GL„{C). Using this map, we interpret the tropicalization of 

the semistandard tableaux basis of an irreducible representation of GLn (C) as combinatorial invariants 
r^ , of phylogenetic trees. 
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1. INTRODUCTION 

The space Kn of phylogenetic trees with n leaves was introduced by Billera, Holmes and Vogtman in 
|BHV| to give a geometric context to phylogenetic algorithms from mathematical biology. As a space, 
K„ is a fan, connected in codimension 1, with one maximal cone for each trivalent tree with n leaves. 




Figure 1. a trivalent tree with 5 leaves 

A general point T G Kn is a tree with n leaves with a specified metric, which is defined by an 
assignment of a non- negative real number i{e) to each edge e G E{T)- This defines a discrete metric 
on the set of leaves of T. The pairwise distances between the leaves form a vector (P (T) of length (2) , 
which completely determines T. 

Theorem 1.1. Ifd^{T)^ <P{T') then T = T . 

This vector is known as the 2— dissimilarity vector of 7". Not all discrete metrics come about in this 
way, so it is useful to have a theorem which classifies those which come from 2— dissimilarity vectors of 
trees, this is where tropical geometry enters the picture. Let T = RU{— 00} denote the tropical real 
line. For a polynomial / = XI C',^x'", the tropicalization is the following partial linear form. 

(1) r(/) ^ max{. ■ ■ , y^ mjXi, . . .} 

The tropical variety tr{f) associated to / is the set of all points t G T^ which make at least two linear 
forms in the expression T(/) maximum. For a polynomial ideal /, the tropical variety tr{I) is defined 
as an intersection. 



This work was supported by the NSF fellowship DMS-0902710. 

1 



CHRISTOPHER MANON 



(2) tr{I) ^ fl tr{f) 

The following theorem of Speyer and Sturmfels |SpSt| classifies those discrete metrics which come 
from 2-dissimilarity vectors of trees in Kn ■ 

Theorem 1.2. A vector v G K^^j {g ^/jg 2— dissimilarity vector of a tree T E Kn if and only if it 
is a point on the tropical variety defined by the Pliicker embedding of the Grassmannian tr{l2^n)- In 
particular, it is necessary and sufficient that for any distinct indicies i,j,k,£, two of the following 
expressions must be equal and larger than the third. 

Vij + Vki,Vik + Vji,va + Vjk 
In particular, the 2— dissimilarity vectors define a map, which is 1 — 1 and onto. 



(3) 



d^ -.Kn^ trih^n) 



This lead Pachter and Speyer jPS) to consider the ?7i— dissimilarity vectors d'^{T) of metric trees, 
which are defined in a similar manner. 

Definition 1.3. Let a C [n] be a set of m indices. For a tree T G K„ define dc{T) to be the sum of 
the lengths of all edges which appear in the convex hull of the indices a. Define d™ (7") to be the ( " ) 
vector with entries da-{T). 




Figure 2. The convex hull of three leaves 

Speyer, Pachter and later Cools conjectured a relationship between d™{T) and the higher tropical 
Grassmannian varieties. Speyer and Pachter also showed that the to— dissimilarity vector of an n— tree 
7" determines 7" if n > 2to — 1 . 

Theorem 1.4. The point d™(T) lies on the tropical Grassmannian tr{Im,n), in particular d™'{T) is a 
solution to the tropicalization tr{f) of each polynomial from the {m,n)— Pliicker ideal f € Im.n- 

This theorem was proved by the author in [Ml] and Giraldo in [G] with notably different tech- 
niques. Our solution linked the combinatorics of the tti— dissimilarity vector to the structure of the 
(to, n)— Pliicker algebra as a representation of the special linear group SLm{'C). We refer the reader to 
the book of Fulton and Harris for the basics of the representation theory of the special linear group. 
Letting loi be the first fundamental weight of SLm{C), and for a weight A let V{X) be the corresponding 
representation, we have the following expression. 



(4) 



P„: 



[Vinul] 
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In some sense this is not the natural presentation of the Pliicker algebra as an algebra with representation- 
theoretic meaning. It arrives by the first fundamental theorem of invariant theory, but a more natural 
way to obtain the Pliicker algebra is as the projective coordinate ring of Grm(C") with its structure as 
a GL„(C) variety. 

(5) Pr,^,n= V{NuJm,) 

NeZ>o 

The purpose of this note is to establish tropical properties of dissimilarity vectors of trees using 
this different representation theoretic point of view. Along the way we will show that not only the 
Grassmannian varieties, but every variety with GL„(C)-symmetry carries a map from Billera-Vogtman- 
Holmes tree space to its tropical varieties. We employ the same method used in jM2) . the language of 
branching algebras and branching valuations. Our methods are applicable to reductive groups of other 
types and we intend to work out the space analagous to Kn for type D in a forthcoming publication. 



1.1. tropical structure of dissimilarity vectors. In |M2] our method was to construct the Billera- 
Holmes-Vogtman space of phylogenetic trees Kn as a subfan of the valuations on the Pliicker algebra 
Pm,n- We then employed the following theorem from tropical geometry, for a commutative algebra A 
we denote the space of valuations on A into the tropical line T by Vt(^). 

Theorem 1.5. For any set xi, . . .Xn & A, and any valuation v S Vt(^), the point (w(xi), . . . , v{xn)) € 
T" lies in the tropical variety of the ideal of forms I which vanish on xi, . . . ,Xn in A. 

(6) v{x) e tr{I) 

It was then shown that evaluating the valuation vj- corresponding to a metric tree T G Kn on the 
Pliicker generators Zo- € Pm,n yielded the ?7i^ dissimilarity vector of 7", 

(7) vriza) = d^m 

The valuations vj- were constructed for d"^{T) as a special case of a general method to construct 
valuations on algebras which capture branching information for morphisms of reductive groups. As a 
sum of invariants in n— fold tensor products of irreducible SLmifC) representations, the Pliicker algebra 
can be realized as a subalgebra of the full tensor algebra of SLm{<C). 

(8) P,n,n C R{5n-l) 

This algebra captures the branching data for 5n-i : SLmiC) — ?> ^^^(C)""^, the diagonal embedding, 
and it has all n— fold tensor products of irreducible S'Lm(C) representations as graded summands. 

(9) RiSn-i)= iIomsi„(c)(F(Ao),F(Ai)®...«.F(A„_i)) 

A6A" 

In general, there is a multigraded algebra R{(f>) assigned to a morphism of reductive groups (p : H ^ G 
which algebraically encodes the problem of restricting representations from G to H, we will discuss these 
algebras and their deformations in section [51 In |M2j and [Ml] it was shown that each factorization 
= 0io. . .o0„j of in the category of reductive groups yields a cone B{(j)) C Yt{R{(J))). When coupled 
with Theorem [TT5] above, this gives a method for producing complexes of points on the tropical varieties 
associated to R{4>) and its sub-algebras, which inherit the combinatorial structure of the category of 
reductive groups. In particular, the diagonal morphism Sn-i '■ SLm{'C) — >■ SLm{'C)"'~^ comes with a 
factorization for every tree on n leaves, given by other diagonal morphisms, see |M2| . 

This gives a cone C{'T) for each such tree, and the valuations vj- are special points in this cone, chosen 
for their desireable properties when evaluated on the Pliicker generators. In this way the factorization 
properties of diagonal maps realize dissimilarity vectors of trees by valuations on the Pliicker algebra. 
This is the essence of the proof of Theorem ll.4l from |M2j . Notably this produces many more valuations 
then just those coming from Kn, presumably these hide interesting combinatorial information about 
the trees T, the Pliicker algebra Pm,n and other algebras inside the full tensor algebra. 
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SL^{€) 




-•-- SX,„(C) ^_^ SL 




Figure 3. a factorization of 5?. 



1.2. factorization by general linear subgroups of GLn{C). The Pliicker algebra can also be 
constructed as a subalgebra of the branching algebra R{1g), associated to the identity morphism 
1g : 1 ^ GL„(C). 



(10) 



R^g) = vi\) 



AeA 



Multiplication in this algebra is computed via Cartan multiplication, V{X) ® V{ri) -^ V{\ + rf). This is 
the coordinate ring of GL„(C)/C/+, where C/+ is the subgroup of upper triangular unipotent matrices. 
This is also the Cox ring of the full flag variety, in particular the projective coordinate ring of any flag 
variety G'L„(C)/P, for any ample line bundle £(A) sits inside R{1g) ^s a subalgebra. 



(11) 



^A = ^(^^) ^ ^(1g) 



Af>0 



In this way we obtain the Pliicker algebra is a subalgebra of R{1g) as Pm,n — Ru^- This allows us to 
once again analyze some tropical geometry of the Grassmannian, and indeed any flag variety in type 
A, using the branching construction on factorizations of 1 — > GL„(C). In particular we can obtain a 
cone of valuations in Yj{Ru:^) for any nested sequence of subgroups of GL„(C). Our main tool will be 
nested products of general linear groups. 

For a set of indices a C [n] of size |cr| = fc we define the subgroup v : GLk{C) — > GL„(C) to be the 
subcopy of GLfc (C) on the indices a. We represent this with the model matrix diagram below. 
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Z 



\ 



J 



Here an element of GLs{C) x GL2{C) on the indices {1,2,4} and {3,5} respectively defines an 
element of GL^IC). In this way, every n + 1 tree 7" defines a directed system of general linear groups, 
where an edge e S E{T) defines the subgroup ii(e)(GL/c(C)) C GL„(C) where L{e) is the set of leaves 
I such that the unique path between I and in T passes through e. 

Each vertex v £ V{T) then corresponds to a map iL{ei) x . . . x «L(ej) : Gifci(C) x . . . x GLk^ — > 
GLm(C), where ki = |i(ei)|, and m = \L{e)\. In particular, the leaves of T are assigned the corre- 
sponding copy of C* C GLn (C) at the proper index in the diagonal. 

Following the branching construction, we get a valuation on the algebra R{1g) by assigning a function 
p, linear on the weights of the appropriate GLk (C) to each edge of this tree. We need p to act in such 
a way that every dominant weight receives a positive real value, and for every pair of dominant weights 
A, r], the weights appearing in V{\) ® V{r]) receive weight less than or equal to p(A) + p{ri) = p{X + -q). 
We choose the functional p that counts the number of boxes in the top row of the Young Diagram of 
the associated weight A. 
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GLsiC) 




Figure 4. a factorization of 1gl3(C) 
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Figure 5. weight of a Young tableaux 

This particular functional has the nice property that p{uJi) = 1 for i = I, . . . ,n (recall that w„ — det). 
In this way, a metric tree 7" with topological structure equal to the chosen structure defines a valuation 
wr on R{1g) by the assignment of i{e)p to e 6 E{T). 

1.3. branching diagrams. In section [2] we will use this to prove the following proposition about the 
exterior forms, which form a basis of V{LJi). 

Theorem 1.6. Let T he an n + 1 tree, and z^ = Zi^ /\ . . . h Zi^ G /\"*(C"). Then wq-{za) = do^aiT). 

Note that the vector associated to the PlUcker generators of Pm,n by this method is not the complete 
TO— dissimilarity vector of T, as each component always contains an index 0. We are not re-proving 
Theorem II. 4[ but we are showing that these specialized components of dissimilarity vectors are also 
solutions to tropical equations coming from Grassmannians. Using the branching algebra -R(Ig) also 
offers enough flexibility to allow us to say something about general flag varieties, and even general 
algebras with a GL„(C) action. 

Recall that a representation V{X) of G'i„(C) has a basis given by semistandard fillings T of the 
Young diagram associated to the weight A. Let zt be the basis member of V{X) associated to the filling 
T. Each T can be viewed as a collection of sets of indices, corresponding to the columns of T. 



(12) 



r = [ai,. 



jO-fcJ 



Indeed, the element zt is the image of the tensor Zg-i 
tations. 



) Za-^ under the unique map of represention- 



(13) 



TTx : ^(w|.i|) ® . . . ® n^k.l) -^ ViJ2^\a,\) = ViX) 



This map is the multiplication map in R{1g), so from general properties of valuations we get the 
following corollary. We define do,T(7") = J2a eT^o,aiiT). 
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Corollary 1.7. For any semistandard tableaux T with associated element zt G ^(1g) we have the 
following equation. 

(14) wrizr) = do,T{T) 

This gives a map from Kn+i to the tropical varieties of any projective coordinate ring of any line 
bundle over any flag variety of type A, or indeed any algebra with a standard tableaux generating set. 
We also get the following generalization of Theorem 11.61 

Corollary 1.8. Let f Cz I\ be an element of the ideal which vanishes on the semistandard tableaux 
generators of the projective coordinate ring R\, associated to C{X) of a flag variety GL„(C)/P(A). Then 
the tableaux dissimilarity components rfo.Ti, as Ti runs over the basis ofV{X), satisfy T{f). 

In [M2' we showed a very general result on G— algebras, Theorem 3.5. A consequence of this is that 
any complex of valuations arrising from the branching valuation construction on R{1g) also passes to 
a complex of valuations on an algebra A with a rational GL„(C) action. 

Theorem 1.9. Let A be a comm,utative algebra with a rational action by GLn{<C), then there is a map 

Kn+l -^ Vt(A). 

Just as each R\ defined its own set of tropical invariants of trees T G Kn+i in the form of the 
tropicalizations of semi-standard tableaux, any generating set X C A oi such an algebra will define 
a set of tropical invariants of T which satisfy the tropicalized equations in the ideal defined by the 
presentation of A by X. 

2. OVERVIEW OF BRANCHING VALUATIONS 

Here we give an overview of the essential ideas used to proved Theorem 11.61 For more on branching 
valuations we direct the reader to |M2) and [Ml] . For a map of reductive groups (f) : H ^ G we define 
a commutative algebra over C. 

(15) Ricj)) ^ [Rh (^ Rg]" 

Here invariants are defined with respect to the action of H on Rh and H on Rq through </>. As a 
multigraded vector space, this algebra can be formulated as follows, 

(16) i?(0)= HomH{V{v),V{X)) 

where the sum runs over all pairs of dominant H and G weights. Now we factor by a pair of morphisms 
in the category of reductive groups. 

H -^^ K — ^-^ G 
From this factorization we may refine the multigrading presented above. 

(17) HomHiV{rj),V{X)) = i/om^ (T/(r,), T/(t)) ® HomKiV{r^), V{X)) 

t€Ak 

This is a formal consequence of the semisimplicity of the categories of finite dimensional represen- 
tations of reductive groups. Notice that the components on the right hand side can be viewed as 
summands in R{ip) ® R{t:). We simplify notation a little and rename the above summands. 

(18) W^(?7, r, A) = HomHiVif]), V{t)) ® HomK{V{ri), V{X)) 
The following theorem can be found in [Ml] and |M2] . 
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Theorem 2.1. The following holds under multiplication in the algebra R{(f>). 
(19) W{m,n,X,)xW{v2,T2,X2)c W{m+r,2,T,X, + X2) 

T<Ti+T2 

Moreover, when projected onto the highest weight component on the right hand side, the resulting mul- 
tiplication operation agrees with multiplication of the corresponding components in R{ip) O R{'k). 

The same construction can be made for any chain of group morphisms. 

H -li-^ K^ -J^ . . . -i^ K^ -i!=±l^ G 

This results in a multifiltration of i?((/)) by tuples of dominant weights from Ki, . . . Km- We can turn 
this into a valuation in a number of ways, each given by a choice of functional pi on the dominant 
weights for each Ki which assigns the largest number to ti + T2 in the sum in the theorem above. 
There is a cone B{(j)) of these functionals, its properties are discussed in [Ml] . The cones for different 
factorizations fit together into a complex K^j, of valuations on R{(j>), this is also discussed in IMlj . The 
important point here is that this complex inherits combinatorial properties naturally from the category 
of reductive groups. 

In order to evaluate a valuation v^ on an element / G i?(0) one must compute the branching diagram 
of /. This means we must decompose / into its homogenous components along the multifiltration as in 
Equation[THl This entails recording the dominant weights of / considered as a vector in a representation 
of each group Ki. Once this is done, the value of the valuation is given by applying the functional p 
to the resulting tuple of dominant weights. In section [3] below we do this for our tree valuations and 
members of the basis of Rgl„ (C) defined by the semistandard tableaux. 

3. DISSIMILARITY VECTORS AND SEMISTANDARD TABLEAUX 

In this section we compute branching diagrams for the basis B{X) C V{X) of semistandard tableaux 
of an irreducible representation of GLn(C) for the branching by given by an n + 1 tree T constructed 
in the introduction. We then explain how to produce a valuation wj- on R{1g) from a metric on T, 
and compute this valuation on members of the basis B(X), this will prove Theorem 11.61 and Corollary 

We begin by computing the branching diagrams of the exterior forms Zg-, \a\ = m, which constitute 
a basis of /\'"(C"). This representation branches in a very simple way over the subgroup GLk{C) x 
GLn-ki'C) C GL„(C) corresponding to the decomposition C" = C'^ © C"~'^. We have the following. 

m i k 

(20) A('^")= A('^')®A('^""') 

i+j—Tn 

Let landJ be the subsets of the index set [n] defined by the above direct sum decomposition, then 
in terms of the exterior form basis of each space involved we have, 

o-n/l |crnj| 

(21) z^ = z^ni A z^n,/ C /\ (C^) ® /\ (C""''^). 

The result of this step is an exterior form Zg-n/ on each new branch of the diagram, so the rest of the 
branching diagram is computed in the same way. This observation proves the following. 

Proposition 3.1. For an n + 1 tree T, an edge e £ EiT), and an exterior form z„ £ /\ (C"), the 
branching diagram of z^ has a tableaux of shape [1, . . . , 1, 0, . . . 0] at e, where the number of V s is equal 
to \L[e) ncr|, and it is thought of as a dominant weight for GL\i^t^\i{C). 

For T a semistandard tableaux, we can represent zt G V{X), \T\ = A as a tensor product of exterior 
forms. Let T — [ai, . . . , CTfe], then there is a surjection tt\ : V{uj^„-^^) . . . (g) V{uj^cr^\) — > ^(A), such that 
the following holds. 

(22) ZT = TT\{Zai <E) ■ . ■ (E) Zc,^) 
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Figure 6. trivalent tree 



Corollary 3.2. For an n + 1 tree T, and element zt the branching diagram of zt is the sum of the 
branching diagrams for the z^^ , defined above. 

U 



Proof. This follows from the general version of Theorem 11.61 above. 

This corollary establishes enough for us to evaluate our branching valuations on the basis zt G 
B{X) C V{X). Once again following section[2l we now produce the valuation wj- associated to a metric 
tree T- From the previous section, it suffices to assign a coweight to each edge e G E{T). 

Definition 3.3. Let p be the coweight which counts the number of boxes in the first row of the tableaux 
representing a weight A. The valuation wj- is defined by assigning £{e)p to the edge e, where £{e) is the 
length of the edge e G E[T). 

It is then clear that w-j-{z„) gives the sum of the lengths of the edges e which appear in the com- 
binatorial convex hull of the indices cr U {0}, as these are the only edges with non-0 branching weight, 
and the coweight p assigns each non-zero weight in this diagram a 1. This proves Theorem 11.61 

Even though we are taking only some of the components of the dissimilarity vectors of an n + 1 tree 
T, we can still recover the structure of T from this information. In fact, it is enough to just have the 
vj- evaluations of the degree 1 and degree 2 Phicker coordinates. As defined, this set already includes 
all of the 2-components doj, and it is simple to verify the following. 



(23) dij — 2dQ^ij — do^i — doj 

Since the 2-disimilarity vector of a tree completely determines the metric structure, this shows that 
w-y-{zi) and W'T-{zi A Zj) determine T. Theorem 11.61 above also implies that the rooted dissimilarity 
numbers do,(T satisfy many combinatorial tropical equations. 
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